Variable Sized Partitions for Range Query Algorithms
نویسندگان
چکیده
A range query applies an aggregation operation over all selected cells of an OLAP data cube where selection is specified by the range of contiguous values for each dimension. Many works have focused on efficiently computing range sum or range max queries. Most of these algorithms use a uniformly partitioning scheme for the data cube. In this paper, we improve on query costs of some of these existing algorithms by noting two key areas. First, end-user range queries usually involve repetitive query patterns, which provide a variable sized partitioning scheme that can be used to partition the data cubes. Query costs are reduced because pre-computation is retrieved for entire partitions, rather than computed for a partial region in many partitions, which requires large amounts of cell accesses to the data cube. Second, data in the data cube can be arranged such that each partition is stored in as few physical storage blocks as possible, thus reducing the I/O costs for answering range queries.
منابع مشابه
مدل جدیدی برای جستجوی عبارت بر اساس کمینه جابهجایی وزندار
Finding high-quality web pages is one of the most important tasks of search engines. The relevance between the documents found and the query searched depends on the user observation and increases the complexity of ranking algorithms. The other issue is that users often explore just the first 10 to 20 results while millions of pages related to a query may exist. So search engines have to use sui...
متن کاملOn the Difficulty of Range Searching
We consider the general problem of (2-dimensional) range reporting allowing arbitrarily convex queries. We show that using a traditional approach, even when incorporating techniques like those used in fusion trees, a polylogarithmic query time cannot be achieved unless more than linear space is used. Our arguments are based on a new non-trivial lower bound in a model of computation which, in co...
متن کاملWised Semi-Supervised Cluster Ensemble Selection: A New Framework for Selecting and Combing Multiple Partitions Based on Prior knowledge
The Wisdom of Crowds, an innovative theory described in social science, claims that the aggregate decisions made by a group will often be better than those of its individual members if the four fundamental criteria of this theory are satisfied. This theory used for in clustering problems. Previous researches showed that this theory can significantly increase the stability and performance of...
متن کاملComparative Approaches of Query Optimization for Partitioned Tables
Due to vast use of Internet data grows explosively. A query that is fired on table may require a complete table scan which can take a long time as it has to inspect every row in table. Since, there is no way to identify this problem, becomes more sever for historical tables for which many queries concentrate, access on rows that were generated recently. Partition helps to solve this problem. Pa...
متن کاملA Mathematical Model and Grouping Imperialist Competitive Algorithm for Integrated Quay Crane and Yard Truck Scheduling Problem with Non-crossing Constraint
In this research, an integrated approach is presented to simultaneously solve quay crane scheduling and yard truck scheduling problems. A mathematical model was proposed considering the main real-world assumptions such as quay crane non-crossing, precedence constraints and variable berthing times for vessels with the aim of minimizing vessels completion time. Based on the numerical results, thi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002